Pagerank-like algorithm for ranking news stories and news portals
نویسنده
چکیده
News websites are one of the most visited destinations on the web. As there are many news portals created on a daily basis, each having its own preference for which news are important, detecting unbiased important news might be useful for users to keep up to date with what is happening in the world. In this work we present a method for identifying top news in the web environment that consists of diversified news portals. It is commonly know that important news generally occupies visually significant place on a home page of a news site and that many news portals will cover important news events. We used these two properties to model the relationship between homepages, news articles and events in the world, and present an algorithm to identify important events and automatically calculate the significance, or authority, of the news portals.
منابع مشابه
PageRank with Text Similarity and Video Near-Duplicate Constraints for News Story Re-ranking
Pseudo-relevance feedback is a popular and widely accepted query reformulation strategy for document retrieval and re-ranking. However, problems arise in this task when assumed-to-be relevant documents are actually irrelevant which causes a drift in the focus of the reformulated query. This paper focuses on news story retrieval and re-ranking, and offers a new perspective through the exploratio...
متن کاملNews Topic Tracking and Re-ranking with Query Expansion Based on Near-Duplicate Detection
Increase of digital storage capacity enabled the creation of large-scale news video archives. To make full use of the archive, it is necessary to grasp the development and dependencies of news stories. Considering this problem, we investigate tracking and re-ranking methodologies of news stories. The archive used as a test-bed consists of more than 30,000 news stories. This paper proposes a nov...
متن کاملTop Stories Identification From Blog to News in TREC 2010 Blog Track
In 2010 Blog Track, there are two tasks including Faceted Blog Distillation Task and Top Stories Identification Task. We mainly focus on the Top Stories Identification Task. In this task, there are two issues to solve. The first issue is ranking the important news stories on the specified day, named Story Ranking Task. The second issue is named News Blog Post Ranking Task. News Blog Post Rankin...
متن کاملA Framework for Exploration of News Corpora by Actor Evolution and Interaction
We present a general framework for modeling and exploration of news corpus. The natural way to model a news corpus is as a directed graph where stories are linked to one another through a variety of relationships. We formalize this notion by viewing each news story as a set of actors, and by viewing links between stories as transformations these actors go through. We propose and model a simple ...
متن کاملA Learned Approach for Ranking News in Real-Time Using the Blogosphere
Newspaper websites and news aggregators rank news stories by their newsworthiness in real-time for display to the user. Recent work has shown that news stories can be ranked automatically in a retrospective manner based upon related discussion within the blogosphere. However, it is as yet undetermined whether blogs are sufficiently fresh to rank stories in real-time. In this paper, we propose a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013